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(54) Propagating updates efficiently in hierarchically structured data under a push model 



(57) One embodiment of the present invention pro- 
vides a system that efficiently propagates changes in 
hierarchically organized data to remotely cached copies 
of the data. The system operates by receiving changes 
(402) to the data located on the server (204). and apply- 
ing (404) the changes to the data on the server (204). 
These changes are propagated to remotely cached 
copies of the data on a client (206. 208. 210. 212) in 
response to an event on a server, and independently of 
events on the client, by (1) determining differences 
(406) between the current version oi the data at the 
server and.an older copy of the data at the client, which 
the server has stored locally; (2) using the differences to 
construct (408) an update for the copy of the data. 



which may include node insertion and node deletion 
operations for hierarchically organized nodes in the 
data; and (3) sending (410) the update to the client 
where the update is applied to the copy of the data to 
produce an updated copy of the data. According to one 
aspect of the present invention, the act of determining 
differences, and the act of using the differences to con- 
struct the update both take place during a single pass 
through the data. According to another aspect of the 
present invention, the update for the copy of the data 
may include node copy, node move, node collapse, 
node split, node swap and node update operations. 
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Description 
BACKGROUND 

(00011 The present irv •• ■ .on relates to distributed 5 
computing systems and daiabases. More particularly, 
the present invention relates to a method and an appa- 
ratus that facilitates detecting changes in hierarchically 
structured data and producing con'esponding updates 
for remote copies of the hierarchically structured data. lo 
[0002] The advent of the Internet has led to the 
development of web browsers that allow a user to navi- 
gate through inter-linked pages of textual data and 
graphical images distributed across geographically dis- 
tributed web servers. Unfortunately, as the Internet t5 
becomes increasingly popular, the Internet often experi- 
ences so much use that accesses from web browsers to 
web servers often slow to a crawl. 
(0003) In order to alleviate this problem, a copy of a 
portion of a web document from a web server (docu- 20 
ment server) can be cached on a client computer sys- 
tem, or alternatively, on an intermediate proxy server, so 
that an access to the portion of the document does not 
have to travel all the way back to the document server 
Instead, the access can be serviced from a cached copy 25 
of the portion of the document located on the local com- 
puter system or on the proxy server. 
(0004] However, if the data on the document server 
is frequently updated, these updates must propagate to 
the cached copies on proxy servers and client computer 30 
systems. Such updates are presently propagated by 
simply sending a new copy of the data to the proxy serv- 
ers and client computer systems. However, this tech- 
nique is often inefficient because most of the data in the 
new copy is typically the same as the data in the cached 35 
copy In this case, it would be more efficient to simply 
send changes to the data instead of sending a complete 
copy of the data. 

[0005] This is particularly true when the changes to 
the data involve simple manipulations in hierarchically ^0 
structured data. Hierarchically structurec- data typically 
includes a collection of nodes containing data in a 
nunnber of forms including textual data, database 
records, graphical data, and audio data. These nodes 
are typically inter-linked by pointers (or some other type 45 
of linkage) into a hierarchical structure, which has 
nodes that are subordinate to other nodes, such as a 
tree although other types of linkages are possible. 
(0006] Manipulations of hierarchically structured 
data may take the form of operations on nodes, such as so 
node insertions, node deletions or node movements 
Although such operations can be succinctly stated and 
easily performed, there presently exists no mechanism 
to transmit such operations to update copies of the hier- 
archically structured data. Instead, existing systems first ss 
apply the operations to the data, and then transmit the 
data across the network to update copies of the data on 
local machines and proxy servers. 



SUMfVIARY 

[0007] One embodiment of the present invention 
provides a system that efficiently propagates changes 
in hierarchically organized data to remotely cached cop- 
ies of the data. The system operates by receiving 
changes to the data located on the server, and applying 
the changes to the data on the server. These changes 
are propagated to remotely cached copies of the data in 
response to an event on the server and iridependentty 
of the client, by (1) determining differences between the 
current version of the data at the server and an older 
copy of the data at the client, which the server has 
stored locally: (2) using the differences to construct an 
update for the copy of the data, which may include node 
insertion and node deletion operations for hierarchically 
organized nodes in the data; and (3) sending the update 
tc the client where the update is applied to the copy of 
the data to produce an updated copy of the data. 
According to one aspect of the present invention, the 
act of determining differences, and the act of using the 
differences to construct the update both take place dur- 
ing a single pass through the data. According to another 
aspect of the present invention, the update for the copy 
of the data may include node copy, node move, node 
collapse, node split, node swap and node update oper- 
ations. 

BRIEF DESCRIPTION OF THE RGURES 

10008] 

FIG. 1 illustrates a computer system including a 
■ web browser and a web server in accordance with 
an embodiment of the present invention. 
FIG. 2 illustrates a computer system including a 
server that automatically updates local copies of 
documents in accordance with another embodi- 
ment of the present invention. 
FIG: 3 is a flow chart illustrating how a client 
requests an update from a server in accordance 
with an embodiment of the present invention. 
FIG. 4 is a flow chart illustrating how a server auto- 
matically updates local copies of documents in 
accordance with an embodiment of the present 
invention. 

FIG. 5 is a flow chart illustrating how the system 
creates updates for a new copy of hierarchically 
structured data in accordance with an emkjodiment 
of the present invention. 

FIGs. 6A-6I illustrate the steps involved in creating 
updates to transform a document tree T1 into a 
document tree T2. 

DETAILED DESCRIPTION 

[0009] The following descrption is presented to 
enable any person skilled in the art to make and use the 
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invention, and is provided in the context of a particular 
application and its requirements. Various modifications 
to the disclosed embodiments will be readily apparent to 
those skilled In the art. and the general principles 
defined herein may be applied to other embcdiments 
and applications without departing from the spirit and 
scope of the present invention. Thus, the present Inven- 
tion is not intended to be limited to the embodiments 
shown, but is to be accorded the widest scope consist- 
ent with the principles and features disclosed herein. 
[0010] The data structures and code described in 
this detailed description are typically stored on a com- 
puter readable storage medium, which may be any 
device or medium that can store code and/or data for 
use by a computer system. This includes, but is not lim- 
ited to, magnetic artd optical storage devices such as 
disk drives, magnetic tape. CDs (compact discs) and 
DVDs (digital video discs), and computer instruction sig- 
nals embodied in a carrier wave. For example, the car- 
rier wave may carry information across a 
communications network, such as the Internet. 

Computer System 

[0011] FIG. 1 illustrates a computer system includ- 
ing a web browser and a web server in accordance with 
an embodiment of the present invention. In the illus- 
trated embodiment, network 102 couples together 
server 104 and client 106. Network 102 generally refers 
to any type of wire or wireless link between computers, 
inclLxJing, but not limited to. a tocal area network, a wide 
area network, or a combination of networks. In one 
efTtoodiment of the present invention, network 102 
includes the Internet. Server 104 may be any node cou- 
pled to network 1 02 that includes a mechanism for serv- 
icing requests from a client for computational or data 
storage resources. Client 106 may be any node coupled 
to network 102 that includes a mechanism for request- 
ing computational or data storage resources from 
server 104. 

[0012] Server 104 contains web server 112. which 
Stores data for at least one web sire in the form of inter- 
linked pages of textual and graphical information. Web 
server 112 additionally includes a mechanism to create 
updates for remotely cached copies of data from web 
server 112. 

[0013] Web server 1 12 stores textual and graphical 
information related to various websites in document 
database 116. Document database 1 16 may exist in a 
number of locations and in a number of forms. In one 
embodiment of the present invention, database 116 
resides within the same computer system as sewer 104. 
In another embodiment, document database resides «t 
a remote location, and is accessed by sewer 104 
through network 102. Note that portions of document 
database 116 may reside in volatile or non-volatile sem- 
iconductor memory Alternatively portions of document 
database 116 may reside within rotating storage 



devices containing magnetic, optical or magneto-optical 
storage media. 

[0014] Client 106 includes web browser 114. which 
allows a user no viewing display 108 to navigate 
5 through various websites coupled to network 102. Web 
browser 114 stores cached copies 118 of portions of 
website documents in local storage on client 106. 
[0015] During operation the system illustrated in 
FIG. 1 operates generally as follows. In connmunicating 
10 with web browser 114. user 110 generates an access to 
a document In web sewer 112. in processing the 
access, web browser 114 first examines cached copies 
1 18 to determine if the access is directed to a portion of 
a web document that is already cached within client 
15 106. If so, client 106 makes an update request 120, 
which is transferred across network 102 to sewer 104. 
In response to the request, sewer 104 generates an 
update 122. which is transferred to web bra^^er 114. 
Update 122 is then applied to the cached copies 118 in 
20 order to update cached copies 118. Finally, the access 
is allowed to proceed on the cached copies 1 18. 
[0016] Note that although the example illustrated in 
FIG. 1 deals wrth web documents for use with web 
browsers and web sewers, in general the present inven- 
ts tion can be applied to any type of data. This may include 
data stored in a hierarchical database. This may also 
include data related to a directory service that supports 
a hierarchical name space. 

[0017] Also, server 104 and web server 112 may 
30 actually be a proxy server that stores data in transit 
between a web server and web browser 114. In this 
case, the invention operates on communications 
between the proxy server and web browser 114. 
[0018] In a variation on the embodiment illustrated 
35 in FIG. 1 , client 106 is a "thin client" with limited memory 
space tor storing cached copies of documents 118. In 
this variation, when .client 106 requests a document, 
only a subset of the document that client 106 Is actually 
viewing sent from server 104 to client 106. This subset 
<o is adaptively updated as client 105 navigates through 
the document. 

[0019] In another variation on the at>ove embodi- 
ment, documents from document datat>ase 116 are 
tree- structured. In this variation, documents or portions 
45 of documents that are sent from server 1 04 to client 1 06 
are first validated to ensure that they specify a proper 
tree structure before they are sent to client 106. This 
eliminates the need for client 106 to validate the data. 
(Validation is typically performed by parsing the data. 
so constructing a tree from the data, and validating that the 
• tree is properly structured.) Reducing this work on the 
client side can be particularly useful for thin clients, 
which may lack computing resources for performing 
such validation operations. 
55 [0020] FIG. 2 illustrates a computer system includ- 
ing a server thai automatically updates local copies of 
documents in accordance with another embodiment of 
the present invention. In the embodiment illustrated in 
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FIG. 2. network 202 couples together server 204 with 
workstation 205. persona! computei 208. network com- 
puter 210 and personal organizer 212. Network 202 
generally refers to any type of wire or wireless link 
between computers, including, but not limited to. a local 
area network, a wide area network, or a combination of 
networks. In one embodiment of the present Invention, 
network 202 Includes the Internet Server 204 may be 
any node coupled to network 202 that includes a mech- 
anism for servicing requests from a client lor computa- 
tional or data storage resources, Servei 204 
communicates with a number of clients, including work- 
station 206. persona! computer 208. network computer 
210 and personal organizer 212. in general, a clieni 
may include any node coupled to network 202 that con- 
tains a mechanism for requesting computational or data 
storage resources from server 204. Note that network 
computer 21 0 and personal organizer 212 are both "thin 
clients." because they have rely on servers, such as 
server 204 for data storage and computational 
resources. Personal organizer 212 refers to any of a 
class of portable personal organizers containing com- 
putational and memory resources. For example, per- 
sonal organizer 212 might be a PALMPILOT'" 
distributed by the SCOM Corporation of Sunnyvale. Cal- 
ifornia. (PalmPilot is a trademark of the 3COM Corpora- 
tion). 

[0021] In the illustrated embodiment, workstation 

206. personal computer 208. network computer 210 
and personal organizer 212 contain cached documents 

207. 209. 211 and 213. respectively. Cached docu- 
ments 207. 209, 211 and 213 contain locally cached 
portions of documents from server 204, 

[0022] Server 204 is coupled to document database 
214. which includes documents to be distributed to cli- 
ents 206, 208. 210 and 212. Document database 2U 
may exist in a number of locations and in a number of 
forms. In one embodiment of the present invention, doc- 
ument database 214 reskjes within the same computer 
system as server 204. In another embodiment, docu- 
ment database resides at a remote locatior that is 
accessed by server 204 across network 202. Portions of 
document database 214 may reside in volatile or non- 
volatile semiconductor memory. Alternatively, portions 
of document database 214 may reside within rotating 
storage devices containing magnetic, optical or magne- 
tooptical storage media. 

[0023] Server 204 includes publishing code 205. 
which includes computer code that disseminates infor- 
mation across network 202 to workstation 206. personal 
computer 208. network computer 210 and personal 
organizer 212. Publishing code 205 includes a mecha- 
nism that automatically creates updates for locally 
cached copies of documents from document database 
214 stored in clients 206. 208. 210 and 212. 
[0024] During operation, the system illustrated in 
FIG. 2 operates generally as follows. Publishing code 
205 periodically receives new content 230, and uses 



new content 230 to update documents within document 
database 2i4. Publishing code also periodically con- 
structs updates for remotely cached copies of docu- 
ments from document database 214. and sends these 
5 updates to clients, such as workstation 206. personal 
computer 208. network computer 210 and personal 
organizer 212. Note that these updates do not simply 
contain new versions of cached documents, but rather.^ 
specify changes to cached documents. 

Updating Process 

[0025] FIG, 3 is a tbw chart Illustrating how a client 
requests an update from a server In accordance with an 

15 embodiment of the present invention. This flow chart 
describes the operation of the invention with reference 
to the embodiment illustrated in FIG. 1. First, the system 
receives a request access the data (step 302). In FIG. 1 , 
this corresponds to user 110 requesting access to a 

30 web page or a portion of a web page through web 
browser 1 14 on client 106. Next, the system determines 
if client 106 contains a copy of the data (step 304). This 
corresponds to web browser 1 1 4 looking in cached cop- 
ies 1 18 for the requested data. If the data is not present 

25 on client 106. the system simply sends a copy of the 
requested data from server 104 to client 106 (step 306). 
and this copy is stored in cached documents 1 1 8. 
[00261 If a copy of the data is present on client 106. 
client 106 sends ari update request 120 to server 104 

30 requesting an update to the copy (step 308). In one 
embodiment of the present invention, update request 
120 includes a time stamp indicating how long ago the 
previous update to cached documents 1 18 was created. 
In response to update request 120. server 104 deter- 

35 mines differences between the copy of the data on client 
106. and the data from document database 116 (step 
310). These differences are used to construct an update 
122. which specifies operations to update the copy of 
the data on client 106 (step 312). Note that if client 106 

40 sends a timesiamp along with the request in step 308. 
the timestamp can be used to determine the differences 
between the data on server 104 and the cached copy of 
the data on client 106. In another embodiment of the 
present invention, server 104 saves update 122, so that 

45 server 104 can send update 122 to other clients. In yet 
another embodiment, server 104 keeps track of 
changes to the data from document database 116 as 
the changes occur; these changes are aggregated into 
update 122. This eliminates the need to actually find dif- 

50 ferences between the data from document database 
1 16 and the cached copy of the data on client 106. 
[0027] Also note that the operations specified by 
update 122 may include manipulations of nodes with in 
the data. For example, if the data is hierarchically organ- 

55 ized as nodes in a tree structure, the update may spec- 
ify tree node manipulation operations, such as move, 
swap, copy, insert and delete operations for leaf nodes. 
The data may also specify sub-tree move, copy. swap. 
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insert, delete operations, as well as interna! node split- 
ting and internal node collapsing operations. Transmit- 
ting such node manipulation operations, instead of 
transmitting the data that results after the node manipu- 
lation operation? have been applied to the d^ta. can s 
greatly reduce the amount of data that must be transmit- 
ted to update a copy of the data on client 106. 
[0028] The update may additionally include a Multi- 
purpose Internet Mail Extensions (MIME) content type 
specifying that the update contains updating operations w 
tor hierarchically organized data. This informs a client 
receiving update 122 that update 122 contains update 
information, and not regular data. The MIME content 
type may specify that update 122 contains updating 
information that has been validated by sewer 104 so /5 
that client 106 does not have to validate update 122. 
[0029] In one embodiment of the present invention, 
the steps of determining the differences (step 310) and 
constructing the update 122 (step 312) take place con- 
currently during a single pass through the data. This 2o 
technique has performance advantages over perform- 
ing these steps separately in two passes through the 
data. 

[0030] Next, update 122 is sent from sewer 104 to 
client 106 (step 314). and client 106 applies update 122 25 
to the copy of the data (step 316). In one embodiment of 
the present invention, the copy of the data is stored in 
semiconductor memory within client 106, and hence 
applying update 122 to the copy of the data involves fast 
memory operations, instead of slower disk access oper- so 
ations, 

[0031] Finally the original access to the data (from 
. step 302) is allowed to proceed, so that user 110 can 
view the data on display 108. The above process is 
repeated for successive accesses to the copy of the as 
data on client 

[0032] Note that although the illustrated embodi- 
ment of the present invention c^erates in the context of 
a web browser and a web server, the present invention 
can be applied in any context where updates to data 
have to be propagated to copies of the data. For exam- 
ple, the present invention can be applied to distributed 
database systems. 

[0033] FIG. 4 is a flow chart illustrating how sewer 
204 (from FIG. 2) automatically updates local copies of 45 
documents in accordance with an embodiment of the 
present invention. This embodiment is an implementa- 
tion of a "push" model, in which data is pushed from a 
sewer 204 to clients 206. 208. 210 and 212 without the 
clients having to ask for the data. This differs from a so 
"request " model, in which the clients have to explicftly 
request data before it is sent as is illustrated in FIG. 1. 
[0034] The ftow chart illustrated in FIG. 4 describes 
the operation of the invention with reference to the 
embodiment illustrated in FIG. 2. First, sewer 204 55 
receives new content 230 (step 402). This new content 
230 may take the form of live updates to document data- 
base 214, for example in the form of stock pricing infor- 



mation. New content 230 is used to update documents 
or other data oljjects within document database 214 on 
server 204 (step 404), 

[0035] Next, publishing code 205 within sewer 204 
deterrnines diHefences between the data in document 
database 214 and copies of the data on clients (sub- 
scribers) 206. 208. 210 and 212. (step 406). These dif- 
ferences are used to construct updates 216. 218, 220 
and 222. which specify operations to change copies of 
the data on clients 206. 208. 210 and 212. respectively 
(step 408). 

I0036J Updates 215.218. 220 and 222 may specify 
operations that manipulate nodes within the data. For 
example, if the data is hierarchically organized as nodes 
in a tree structure, updates 216. 218. 220 and 222 may 
specify tree node manipulation operations, such as 
move, swap. copy, insert and delete operations for leaf 
nodes. The data may also specify sub-tree move, copy 
swap, insert, delete operations, as well as internal node 
splitting and internal node collapsing operations. Trans- 
mitting such node manipulation operations, instead of 
transmitting the data after node manipulation operations 
have been applied to it. can greatly reduce the amount 
of data that must be transmitted to update copies of the 
data on clients 206. 208. 210 and 2i2. 
[0037] In one embodiment of the present invention, 
the steps of determining the differences (step 406) and 
of constructing updates 216. 218. 220 and 222 (step 
408) takes place concurrently during a single pass 
through the data. This can have a significant perform- 
ance advantage over performing these steps in two sep- 
arate passes through the data. 
[0038] Next, updates 216. 218. 220 and 222 are 
sent from server 204 to clients 206. 208. 210 and 212 
(step 410). respectively Clients 205. 208. 210 and 212 
apply updates 216. 218. 220 and 222 to their local cop- 
ies of the data 207. 209, 2 11 and 2 1 3. respectively (step 
412). In one emtxxjiment of the present invention, these 
updates are applied to are applied the local copies 207, 
209, 211 and 213 "in memory.' without requiring disk 
accesses. This allows to updates tr; be performed very 
rapidly. 

[0039] The above process is periodically repeated 
by the system in ader to keep copies of the data on cli- 
ents 206, 208. 210 and 212 at least partially consistent 
with the data on server 204. This updating process may 
repeated at any time interval from, for example, several 
seconds to many days. 

Process of Creating Updates 

[0040] FIG. 5 is a flow chart illustrating how the sys- 
tem aeates updates at the server for a new copy of hier- 
archically structured data in accordance with an 
embodiment of the present invention. This embodiment 
assumes that the data is hierarchically organized as a 
collection of nodes in a tree structure. This tree struc- 
ture includes a root node that can have a number ol chil- 
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dren. These children can also have children, and so on, 
until leaf nodes, which have no children, are reached. 
Note thai the below-described process for creating 
updates requires only a single pass through the data 
During this single pass the system determines differ- 
ences between old and new trees and creates corre- 
sponding updates to convert the old tree into the new 
tre«. Thi9 oDmlnites th« need tor a eeparata time-con- 
suming pass through the data to create updates from 
differences. 

[0041] The system starts with an old tree (old J) 
and a new tree (newj). The system first matches leaf 
nodes of oldj and new_t (step 502). in doing so. the 
system may look for exact matches or partial matches of 
the data stored in the leaf nodes. In the case of partial 
matches, if the quality of a match is determined to be 
above a preset threshold, the leaf nodes are considered 
to be "matched," Next, the system generates deletion 
operations to remove nodes from old J which are not 
present in new J (step 504). 

[0042] In the next phase, the system repeats a 
number of steps (506, 508. 510 and 512) for ascending 
levels of the tree. First, for a given level, the system gen- 
erates node insertion operations for nodes that are 
present in new J but not in old J (step 506) Also, if the 
position of a node in oldJ is different from the position 
of the same node In newJ. the system generates a 
move operation, to move the node from its position in 
old J to Its new position in new J (step 508), Addition- 
ally, if a parent node in old^t does not have all of the 
same children in newj, the system generates a node 
split opwation for the parent, splitting the parent node 
into a first parent and a second parent (step 510). The 
first parent inherits all of the children that are present in 
nevi'j. and the second parent inherits the remaining 
children. If a parent node in oldJ has all of the same 
children and additional children in newj, the system 
generates a node collapse operation to bring all the chil- 
dren together in newj (step 512). 
[0043] Additionally, if all of the children of a first par- 
ent in OldJ move to a second parent in newj, the s, s- 
tem generates a node collapse operation to collapse the 
first parent into the second parent so that all of the chil- 
dren of the first parent are inherited by the second par- 
ent. 

{0044] The system repeats the above-listed steps 
506. 508. 510 and 512 until the root of the tree is 
reached. At this point all of the operations that have 
been generated are assembled together to create an 
update that transforms old J into newj (step 514). 

[0045] Let us consider the example tree illustrated 
in Figure 6A. This tree may represent a document con- 
sisting of sections, paragraphs and individual sentences 
containing parsable character data. Assume that the 
document grammar also allows documents to contain 



non-character data, say numeric data, as is represented 
by the leaf rxxie identifier 'd'. All nodes in FIG. 6.A 
include a name (tag), a value, and an associated value 
identifier. Since the leaf nodes actually contain data. 

5 value identifiers are assigned to them before the proc- 
ess Starts; whereas, for an interna! node, a value identi- 
fier l8 assigned during the comparison process based 
upon the vaiut of identifiers of th« int»rnal nodo's chil- 
dren. Note that in some embodiments of the present 

)o invention, the tree data structure as represented in 
memory may conform to the World Wide Web Consor- 
tium document object model (W3CD0M). 
[0046] Additionally, in some embodiments of the 
present invention, the hierarchically organized data 

75 includes data that conforms to the Extensible Markup 
Language (XfvIL) standard. In other embodiments of the 
present invention, the hierarchically organized data 
includes data that conforms HyperText Markup Lan- 
guage (MTML) standard, and other markup language 

20 standards. 

Notationai Semantics 

[0O47] We represent each leaf node by the path 
25 from root node to the leaf node containing the position 
of each node along the path. Hence, the notation for 
each of the leaf nodes in FIQ. 6A is as follows: 

DO.SeO.PO.SO (left-most node) 

30 DO.SeO.PO.Sl 

DO.SeO.P0,S2 

D0.Se0.P0.S3 

DO.SeO.Pl. SO 

DO.Sel.NO 
35 D0.Se2.P0.S0 

D0.Se2.P1.S0 

D0.Se2.Pl.Sl 

D0.Se2.P2.S0 

D0.Se2.P2.Si (right-most node) 

4G 

The above notation is used to locate and represent any 
node in the tree, whether it be a leaf node or internal 
node. 

[0048] The notationai semantics for each of the tree 
45 transformation operations is as follows: 

• MOV(D0.Se0.P0.S2. D0.Se2.Pl .30). In FIG. 6A. 
this operation moves the leaf node with value iden- 
tifier *a'. Note that a similar operation can be used to 
50 represent a movement of an internal node. In the 
case of an internal node, the entire sub-tree moves. 
Thus, the movement of an irxiividual node or a sub- 
tree can be an inter -parent move or an intra-parent 
move. 

55 • SWP{DO.SeO.P0.S2. DO.SeO.PO.Sl). Tnis opera- 
tion Is permitted only in the case of nodes that 
share a common parent (i.e.. intra-parent only). The 
operation swaps the position of the affected nodes. 
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under the common parent. In the case of internal 
nodes, entire sub-trees are swapped. 

* CPY(DO.SeO.PO. D0.Se2.P2). This operation repli- 
cates a node by making an identical copy on the 

node. In the case of internal nodes, the entire sub- 
tree is copied. 

' INS(DO.SeO.PO.SO, a', (data)). This operation 
inserts a node in the tree at the given position and 
assigns to it a value identifier a' along with the 
{data). In the case of an internal node. 
{data}assigned contains a null value. 

* DEL(DO.SeO.PO) This Operation deletes a node and 
all of its children. 

* SPT(D0.Se0.P0. I) This operation splits a parent 
node into a first ncxje and a second node. All of the 
children of the parent node starting at position i are 
transferred to the first node. The remaining children 
are transferred to the second node. The first node 
gets the same tag type as the original parent node. 

* CLP(D0.SeO.PO. DO.SeO.Pl). This operation col- 
lapses the contents of a first node and a second 
node. The resulting node gets the same tag type as 
the first node. The children of the second become 
the right-most children of the resulting node. 

* UPD{D0.Se0.PaS2. {delta}). This operation speci- 
fies a change{delta)to the contents of a leaf node. 
The {delta) itself describes how to apply (or merge) 
the change. 

[0049] Th example described below generates a 
.. set of operations to transform an old tree Tl (FIG. 6A) 
into a new tree T2 (FIG. 6B). Note that in this example 
. the leaf nodes contain actual data, and the internal 
[ nodes simply contain tags which organize and describe 
the data. There are three phases in the process, includ- 
ing: (1) matching the leaf nodes in Tl and T2; (2) delet- 
ing nodes in T1 with no match in T2; and (3) modifying 
or moving nodes the remaining nodes to create Tl . 

Phase 1 : Matching Leaf Nodes 

[0050] The first st^ is to generate a unique identi- 
fier for each of the leaf nodes in T2 based on the content 
of the leaf node. This can be accomplished by using a 
hash function to generate a unique identifier for each of 
the leaf nodes. If two leaf nodes have the same content, 
then the hash function generates the same identifier. If 
two leaf nodes have the same identifier, it will not cause 
problems, because the process uses the root node to 
leaf node path to identify the individual nodes. 
(0051 } Next, the process assigns value identifiers to 
leaf nodes of Tl. For a given leaf node in Ti. the proc- 
ess uses a hash function to generate a unique identifier, 
which matches one of the leaf node identifiers in T2. if 
the identifier generated does not match any of the iden- 
tifiers in T2. then process attempts to find a closest 
matching leaf node in T2. based on some matching cri- 
teria. For example, the process may use the Longest 



Common Sub-sequence (LCS) algorithm ("Data Struc- 
tures and Algorithnns." Aho. Alfred V,, Hopcroft. John E. 
and Ullm'an, Jeffrey D.. Addison-Wesley, 1983. pp. 189- 
194) to determine a percentage match between the 

5 contents of leaf nodes in T1 and T2. The matching crite- 
rion can be flexible. For example, the matching criterion 
may specify a minimum of 30% commonality in order for 
the leaf nodes to be matched. 
(0052J Allowing matches to be made on an accept- 

w able matching criteria provides a measure of flexibility. 
In case a given leaf node s content has been only 
slightly modified in going from Ti to T2, the system sim- 
ply matches the node with its modified version in T2. 
The process subsequently makes the leaf nodes con- 

15 sistent through the UPD(node. delta) operation. How- 
ever, if the commonality between leaf nodes being 
matched does not satisfy the matching criterion, the 
process assigns a unique value identifier to the leaf 
node in Tl , which indicates that the leaf node has been 

20 deleted. 

[0053J In the worst case, the time complexity of find- 
ing a match between the leaf nodes will beO{K^), where 
K is the number of unique leaf node identifiers in Ti and 
T2. In the best case, where the leaf nodes in T1 and T2 
25 match in a straightlonward manner, the complexity will 
be 2*K. However, the number of changes in a document 
from one version to another is typically fairly small, in 
which case only a few leaf nodes need to be matched 
based on the weak matching criteria. 

Phase 2: Deletion phase 

[0054] After the matching phase is complete, there 
may be some leaf nodes in Ti. which are not matched 
35 to nodes in T2. These unmatched are deleted as fol- 
lows. 

• For unmatched leaf nodes in Tl (from left to hgh\), 
create a delete operation. such as 

4G DEL(D0.Se2.P2.S0). 

Reduce the number of delete operat'ons. by replac- 
ing them with sub-tree delete operations, if possi- 
ble. If all children belonging to a parent are to be 
deleted, the delete operation of each of tfie children 

4S can be replaced by a single delete operation of the 
parent node. This involves scanning the deletion 
list, looking for common parents. If Tl has K levels, 
at most K-1 scans are needed to identify a common 
parent for deletion. Notice that while scanning the 

so ith level, unreduced nodes in the i+1 level can be 
ignored, sirwe they cannot be further reduced. 
After the reductions are performed, the final dele- 
tion list is repositioned, because deleting a node at 
position '0' alters the relative positioning of adjacent 

55 nodes. Hence, if two delete operations are to be 
performed on nodes that have a common parent, 
then the second delete operation needs to be 
altered to reflect the change in position of the sec- 
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ond node to be deleted. 

[0055] * In the instant example, leaf nodes ) and 
i in FIG. 6A are unmatched. In accordance witr : ~ first 
step, the system creates following delete opaatons, 5 

OEL(DO.SeO.PO.SO). 
DEL(DO.SeO.P0.Sl). 
DEL(D0.Se2.P2.S0). 
0EL(D0.Se2.P2.Si). 

In the second step, scanning left to right (scan level 4). 
the system notices that all of D0.Se2.P2"s children are 
to be deleted. By reducing the individual delete opera- 
tions '•DEL(DaSe2.P2.S0)" and •'DEL(D0.Se2.P2.Sl)- j5 
into a single delete operation of the parent 
"DEL(D0.Se2.P2)" we are left the following delete oper- 
ations. 

DEL(DO.SeO.PO.SO). 2G 

DEL(DO.SeO.PO.Sl). 

DEL(D0.Se2.P2). 

[0056] Continuing with the level 3 scan, the system 
notices that the only eligible delete operation for reduc- 25 
tion is DEL{D0.Se2.P2). since the other delete opera* 
tions DEL(DO.SeOPO.SO) and DEL{DO.SeO.PO,Sl) are 
at level 4. Since D0.Se2.P2*s parent has other children 
which do not participate in the delete operation, the 
reduction ends at scan level 3. . 3o 
[0057] In the third step, the system checks to see if 
applying the first delete operation will affect the relative 
node position of any other delete operation. This 
involves looking for nodes having the same parent as 
the node being deleted. If such a node exists, the sys- 35 
tern adjusts its node position accordingly Note that the 
entire deletion list need not be scanned to identify sib- 
ling nodes, because the inherent ordering in the dele- 
tion list ensures that deletion operations tor sibling 
nodes will be dose together in the deletion list. 4G 
[0056] Continuing with the example, the system 
notices that applying the delete operation 
DEL(DO.SeO.PO.yO) will affect the relative positioning of 
sibling node DO.SeO.PO.tl. So. the system adjusts the 
position of its sibling (See FIG. 6C). Hence, the final 45 
deletion list becomes, 

DEL(DO.SeO,PO.SO), 
DEL(DO.SeO.P0.SO). 

DEL(D0.Se2.P2). so 

Phase 3: Modification Phase 

[0059] The modification phase brings together the 
children of internal nodes, in a bottom-up fashion. This ss 
involves scanning ail the nodes from the bottom-nx)st 
l&/el (furthest from the root), and scanning each level 
until level zero is reached. Note that the identity of each 



internal node is established by the collective identity of 
its children. For example, if a parent node s children are 
identified as *a' and "b* respectively then the identity of 
the parent is *ab.' 

[0050] Also, if a parent node is left with no children 
as a result of a move operation, the parent node is 
deleted. Furthermore, in the special case where there is 
a skewed tree or sub-tree of nodes having Just one 
child, i.e.. a->b->c->d. when node 'd' is deleted, node 'c' 
is also be deleted. This action is repeated until node 'a* 
is deleted as well. Instead of generating an individual 
delete operation for each one of the nodes, the chain of 
delete operations is reduced to a single delete operation 
of the grandest common parent of all nodes being 
deleted. 

[0061] Pseudo-code for one embodiment of the 
modification phase appears below. 

For each level J in T2 (leaf to the root) { 

1. TO.BE^COMPLETED^LIST = list of all the 
node value identifiers at levelj in T2. 

2. H the node in the 
TO^BE^COMPLETED^LIST is the root node, 
find the matching node T in TV. If 'f happens is 
a root node, break from the loop. Else, partition 
TV Into two nodes, such that the sub-tree 
rooted at V is moved away from TV, and 
becomes another tree (Ti"). Next, delete the 
source partition (TV) by deleting its grandest 
common parent (the root). TI *" and T2 are now 
identical. 

1 (end of for loop) 

3. Pick one of the nodes *k' from 
TO^BE^GOfy^PLETE DELIST, typically the left- 
most node. SIBLING^LiST = siblings of V. 
including 'k'. Note that we use the term 'node' in 
place of a node identifier, for convenience. 

4. If none of the nodes in the SIBLING.LIST 
have a matching node in TV. create a parent 
node *p* in T1 having the same tag type as the 
one in T2 (i.e. same as the parent of the nodes 
in the sibling list in T2). Insert all of the nodes in 
the sibling list into the newly created parent 
node in TV. Next, move the newly created node 
(along with its children) to be the child of any 
internal node, preferably one of the par en-, 
nodes at leveIJ-2. if such a level exists. 

5. Let S be the subset of nodes In the 
SIBLING^LIST that have a match In TV. Find a 
parent node *p' in TV, which has the nnost sib- 
lings in the SIBLING^LtST. 
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Move the rest of the matched nodes in S. 
to bejhe children of 'p*. If any subset of 

nodes being moved have a common par- 
ent 'q'. and if 'q' has no other children, then 
collapse 'q' into 'p'. Else, Individual nodes 5 
are moved by separate move operations. 
. The unmatched nodes in the 
SIBLING_LIST are inserted into 'p'. 
Order the children of 'p* through swap 
operations. At the erd of the swaps, all the w 
children of 'p' which do not happen to be 
the children of Its peer. If any. are gathered 
in the right-most corner. If there are such 
children, then a node split operation is per- 
formed, so that 'p' has exactly the same 75 
children as its peer. The newly created 
node (sub-tree) is at the same level as 'p' 
and has the same parent as p*. Also, the 
tag type of 'p' is changed to be the same as 
its peer in T2, if K is different. so 

6. Assign a node identity to 'p*. which is the col- 
lective identity of its latest children. Similarly, 
assign an identical identity to the peer node of 

'p' in T2. B5 

7. TO.BE^COMPLETED^LIST 
TO.BE.COMPLETED.LIST'SIBLING LIST. 

8. If TO BE COMPLETED LIST Is not equal to 30 
NULL, then return to step 2. else continue. 

[0062] Note that the above node movement opera- 
tions cause changes in the relative positioning of sibling 
nodes. Hence, the node operations generated by the 3S 
process should take into account the positional changes 
caused by node movements. 
[0063] The system now applies the modification 
algorithm on TV from FIG 6C. 

40 

Level 3 scan 

[0064] Applying steps 1 and 2. 
TO_BE_COMPLETEP^LIST « {g. c. f. e. b. a. z] and 

SIBLING^LIST = {g. c). The system locates the children 45 
"g' and 'c' in TV. and chooses Da.Se2.Pl to be the par- 
ent. Applying step 4, the system notices that nodes 
D0.Se2.P1 and DO.SeO.P1 need to be collapsed. This 
brings together all the nodes in the SIBLING_LIST 
under a comnnon parent (See FIG. 6D). so 

CLP(O0.Se2.Pl. D0.Se0.Pl) 

[0065] Next, the system uses swap operations to 
re-order the nodes (see FIG. 6E), 55 

SWP(D0.Se2.P1.Sl, D0.Se2.P1.S0) 
SWP(O0.Se2.P1.S2. D0.Se2.Pi.Sl) 



[0066] Next, a split operation is performed to move 
away children which do not truly belong to the parent 
(see FIG. 6F) 

SPT(D0.Se2.Pl. 2) 

[0067] Applying step 5. the system generates an 
identity for D0.Se2.P1 and its peer in T2 (see FIG. 6G). 
Though T2 is not shown, it is assumed that the identity 
has been assigned 

[0068] Applying step 6. the system determines that 
TO_BE,COMPLETED_LIST = {f. e. b. a. 2). Since 
TO_BE_COMPLETED_LIST is not empty, the system 
returns to step 2. SIBLING _LIST = (f). Step 3 and 4 do 
not produce any changes. Step 5 assigns an identity to 
D0.Se2.P2. Step 6 removes T from 
TO,BE.COMPLETED_.LIST Repeating the same, the 
system eliminates 'd'. and *e' from 
TO_BE_COMPLETED^LlST 

[0069] At this point. TO_BE_COMPLETE DELIST * 
(b. a. 2) and SIBLING^LIST « (b. a. 2). Step 4 selects 
node DO.SeO.PO as a matching node. At this point, node 
•2* in the SIBLlNG^LfST is unmatched in T2. Hence, the 
system inserts node '2'. Next, the system applies swap 
operations to order the children of DO.SeO.PO. Now. 
TO_BE_COMPLETED LIST is NULL (see FIG. 6H). 

INSCDO.SeO.PO.SO. z. (data)) 
SWP(DO.Se0.P0.S2. DO.SeO.PO.SO) 

Level 2 scan 

[0070] Applying steps 1 and 2 the system deter- 
mines TO.BE^COMPLETED^LIST = {gc, f. e. baz) and 
SIBLING^ LIST = (gc. f). Applying step 4. the system 
chooses D0.Se2 as the parent. The system next applies 
swap operations to order the children, and then split the 
parent D0.Se2 to move away children that do not t:>elong 
to D0.Se2. Applying step 5, the system generates iden- 
titles tor D0.Se2 and its peer in T2 (see FIG. 61). 

SWP(D0.Se2.P0, D0.Se2.Pl) 
SWP(D0.Se2.Pl. D0.Se2.P2) 
SPT(D0.Se2, 2) 

[0071] Now. TO^BE^COMPLETED^ LIST =: {e. 
baz] and SIBLING_.LIST = {e. ba2). Applying step 4. the 
system chooses DO.SeO as the parent. Since, P(e) is 
the only child, the system collapses DO.SeO and 
DO.SeS. and then re-orders the children through swap 
operations. Applying step 5. the system generates iden- 
tities for DO.SeO and its peer in T2 (see FIG. 6J). 

CLP(DO.SeO. D0,Se3) 
SWP(DO.SeO.PO. DO.SeO.Pl) 
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Level 1 scan 

(0072] Applying steps 1 and 2 the system deter- 
mines TO.BE^COMPLETED^LIST = (ebaz. d. gcf) and 
SIBLING„LIST = {ebaz. d, gcf). Step 4 selects DO as 
the parent, and applies the swap operations to re-order 
Its children, which produces T2 (see FIG 6K). 

SWP(DO.SeO. DO.Sei) 

[0073] Hence, the final set of transformations to 
transform T1 to T2 is: 

DEL(DO.SeaPO.SO), 
DEL(DO.SeO.PO.SO). 
DEL(D0.Se2.P2). 
CLP(D0.Se2.Pl. D0.Se0.Pl). 
SWP(D0-Se2.Pl.Si. D0.Se2Pi.S0). 
SWP(D0.Se2.Pl.S2. D0.Se2.PT.Sl). 
SPT(D0.Se2.Pl. 2). 
INS(DO.SeO.PO.SO. 2. {data}). 
SWP(DO.SeO.PO.S2. DO.SeO.PO.SO). 
SWP(D0.Se2P0. D0.Se2 P1). 
SWP(D0.Se2.Pl. D0.Se2 P2). 
SPT{D0.S62. 2). 
CLP(D0.Se0.D0.Se3), 
SWP(DO.SeO.P0. D0.SeO.P1). and 
SWP(D0.Se0. DO.Sel). 

[0074] Additionally, if partial matches of leaf nodes 
were made, the leaf nodes need to be updated using 
UPD operations. 

[0075] The above process requires all nodes in T2 
be visited and matched with corresponding nodes in Ti 
once. The complexity of matching the internal nodes is 
0(nun2). where nl and n2 are the internal node 
counts of Tl and T2. respectively. Note that nodes can 
be matched by hashing node value identifiers. 
[0076] Node movements and modifications also 
add to the overhead. If we consider a cost-based analy- 
sis, the cost of a transformation operation on a node T 
is a function of the number of children of 't'. Thus, the 
net cost of all transformations will be a function of the 
total number of nodes involved directly or indirectly in 
the transformation. 

[0077] Since there are no cydes in the transforming 
operations, the overhead contributed by the node move- 
ments is bounded by 0(LK). where L is the number of 
levels in the tree, and K is a the number of leaf nodes. 
However, typically the number of nodes Involved In the 
movements is very small and does not involve all the 
nodes in a tree. 

(0078) Hence, the worst case time complexity of the 
algorithm is a summation of the cost of matching leaf 
nodes O(K^). the cost of matching internal nodes 
0(nUn2). and overhead contributed by node move- 
ments 0(LK), In an average case analysis, where the 
number of changes to a document are less than, for 



example. 20%. the time cornplexity is a summation o(. 
the cost of matching ieai nodes 0(K). .the cost of match- 
ing internal nodes 0(m+n2). and overhead contributed 
by node movements 0(K). 

5 

optimizations 

[0079] There exist a number of additional optimlia- 
tions that can be applied to the above process. 

10 

While trying to i\nd a parent *p' in Ti ' which has the 
most children in the SIBLING^LIST, if there is tie. 
choose a parent with the same tag-type as the one 
in T2. 

75 " While re-ordering nodes within the same parent 
(intra-node movement) through swap operations, if 
the node being moved out Is not in the 
SIBLING^LISI it can be directly moved to be the 
right-most child. 

30 • While re-ordering nodes within the same parent 
{intra-node movement) through swap operations, if 
the node being moved out is in the SIBLING^LIST, 
try to position the node being moved out through 
another swap operation. 

25 

[0080] The foregoing descriptions of embodiments 
of the invention have been presented for purposes of 
illustration and description only. They are not Intended 
to be exhaustive or to limit the invention to the forms dis- 
3D Closed. Many modifications and variations will be appar- 
ent to practitioners skilled in the art. Accordingly the 
above disclosure is not intended to limit the invention. 
The scope of the Invention is defined by the appended 
daims. 

35 

Claims 

1 . A method for propagating changes in hierarchically 
organized data located on a server (204) to a copy 
40 of the data (207. 209. 21 1. 213) located on a client 
(205, 208. 210, 212). comprising: 

receiving (402) the changes (230) to the data 
(214) on the server (204); 
t5 applying (404) the changes to the data (2 1 4) on 

the server (204); and 

responsively to an event on the server and 
independently of the client, propagating the 
changes to the copy of the data by. 

50 

determining differences (406) between the 
data (214) on the server (204) and the 
copy of the data which the server has 
locally available. 
55 using the differences to construct (408) an 

update (216. 218. 220. 222) lor the copy of 
the data which the server has locally avail- 
able, wherein the update may include node 
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insertion and node deletion operations for 
hierarchically organized nodes in the data, 
and 

sending (410) the update (216. 218. 220. 
222) from I he server (204) to the ciient s 
(206. 208. 210. 212). 

2. The method of claim 1. v/herein the acts of deter- 
mining (406) the differences and constructing (408) 

the update take place during a single pass through io 
the data. 

3. The method of claim 1 or claim 2. wherein the event 
on the server includes a timer completing a prepro- 
grammed time interval, ts 

4. The method of any one of claims 1 to 3, v^/herein the 
event on the server includes a change to the data 
located on the server. 

5. The method of any one of claims 1 to 4 , wherein the 
act of propagating the changes to the copy of the 
data on the ciient takes place automatically at regu- 
lar time intervals or at irregular time inten/als. 

25 

6. The method of any one of claims 1 to 5. wherein the 
act of determining the differences (406) Involves 
aggregating the changes to the data and/or exam- 
ining the data after the changes have been applied 

to the data. so 

7. The method of any one of claims 1 to 6. wherein the 
update (216. 218, 220, 222) includes a Multipur- 
pose Internet Mail Extensions (MIME) content type 
specifying that the update (216.218, 220, 222) con- 3S 
tains updating operations for hierarchically organ- 
ized data. 

8. The method of any one of claims 1 to 7, wherein the 
update may include one or more of the following: <o 

a) a node copy operation that makes an identi- 
cal copy of a node as well as any subtree of the 
node that may exist. 

b) a node move operation that moves a node to -^s 
another location in a tree of hierarchically 
organized nodes. 

c) a node split operation that splits a node into 
a pair of nodes, and divides any children of the 
node that may exist between the pair of nodes, so 

d) a node collapse operation that collapses a 
pair of nodes into a single node, which inherits 
any children of the pair of nodes that may exist. 

e) a node deletion operation that includes 
deleting any nodes that are subordinate to the 65 
node. 

f) a node swap operation that swaps two nodes 
as well as any subtrees of the nodes that may 



exist. 

g) a node update operation. 

9. The method of any one of claims 1 to 8. wherein the 
data tnai is hierarchically organized includes data 
that conforms to the HyperText Markup Language 
(HTML) standard or the Extensible Markup Lan- 
guage (XML) standard. 

10. The method of any one of claims 1 to 9. wherein the 
data that is hierarchically organized includes a hier- 
archical database, and/or a directory service that 
supports a hierarchical name space. 

11. The method of any one of claims 1 to 10. wherein 
the copy of the data located on the dienl (206. 208, 
210. 212) contains a subset of the data (214) on the 
server (204). 

12. The method of any one of claims 1 to 1 1. wherein 
the server (204) includes a proxy server for caching 
data In transit between a server (204) and a client 
(206. 208.210. 212). 

13. The method of any one of claims 1 to 12. wherein 
the update (216, 218. 220. 222) includes data that 
is validated at the sender (204). 

14. A computer readat^le storage medium storing 
instructions that when executed by a computer 
cause the computer to perform the method of any 
one of claims 1 to 13. 

15. An apparatus that propagates changes in data 
located on a server to a copy of the data located on 
a client, comprising; 

a receiving mechanism (205) that receives 
(402) the changes to the data on the server; 
a change application mechanism (205) that 
applies the changes (404) to the data on the 
server; 

a difference determining mechanism that 
determines differences (406) between the data 
on the server and the copy of the data which 
the server has locally available, wherein the dif- 
ference determining mechanism operates 
responsiveiy to an event on the server and 
independently of the client: 
an update creation mechanism that constructs 
(408) an update for the copy of the data, 
wherein the update may include node insertion 
and node deletion operations for hierarchically 
organized nodes in the data; and 
an update sending mechanism, that sends 
(410) the update from the server to the client. 

16. The apparatus of claim 15, wherein the difference 
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determining mechanism and the update creation 
mechanism operate concurrently dunng a single 
pass through the data. 

17. The apparatus oi claim 15 or dalm 16. further com- 
prising an updating mechanism on the client that 
applies the update to the copy of the data to pro- 
duce an updated copy of th« data. 

18. The apparatus of any one of claims 15 to 17. 
wherein the update additionally includes at least 
one from the group of node move, node collapse, 
node split and node update operations. 

19. The apparatus of any one of claims 15 to 18. 
wherein the update includes a Multipurpose Inter- 
net Mail Extensions (MIME) content type specifying 
that the update contains updating operations for 

hierarchically organized data. 

20. The apparatus of any one of claims 15 to 19. 
wherein the copy of the data located on the client 
(206. 208. 210. 212) contains a subset d the data 
on the server (204). 

21. The apparatus of any one of claims 15 to 20, 
wherein the update includes data that is validated 
at the server (204). 

22. A method for propagating changes in daia located 
on a server to a copy of the data located on a client, 
comprising: 

receiving (410) at the client an update (2 16. 
218. 220. 222) tor the copy of the data from the 
server (204), wherein the update may include 
node insertion and node deletion operations for 
hierarchically organized nodes in the data: 
applying (412) the update to the copy of the 
data to produce an updated copy of the data. 

23. The method of claim 22. wherein the update addi- 
tionally includes at least one from the group of, 
node move, node collapse, node split and node 
update operations. 

24. The method of claim 22 or claim 23. wherein the 
update includes a Multipurpose Internet Mail Exten- 
sions (MIME) content type specifying that the 
update contains updating operations for hierarchi- 
cally organized data. 

25. The method of any one of claims 22 to 24. wherein 
the copy of the data located on the client contains a 
subset of the data on the server. 

26. The method of any one of claims 22 to 25. wherein 
the update includes data that is validated at the 



server. 

27. The method of any one of claims 22 to 26. wherein 
the act ol applying the update to the copy of the 
5 data takes place in semiconductor memory, 
whereby the update is able to proceed rapidly in the 
absence of time-consuming i/0 operations. 
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